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A method of representing probabilistic aspects of quantum systems is introduced by means of a 
density function on the space of pure quantum states. In particular, a maximum entropy argument 
allows us to obtain a natural density function that only reflects the information provided by the 
density matrix. This result is applied to derive the Shannon entropy of a quantum state. The 
information theoretic quantum entropy thereby obtained is shown to have the desired concavity 
property, and to differ from the the conventional von Neumann entropy. This is illustrated explicitly 
for a two-state system. 
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In standard quantum mechanics, the information 
about physical observables is contained in the state of 
the system, which is represented by a density matrix p'^ . 
This is because the expectation of an observable F!^ in 
the state p^ is given by the trace formula 



(F) = P%fP 



(1) 



and it is through such expectations that the statistical 
properties of measurement outcomes are determined. In- 
deed, for a state we require p^ to be nonnegative and 
to have trace unity. These properties suggest that the 
density matrix can be viewed as a probability distribu- 
tion. For example, if p'i is nondegenerate, with distinct 
eigenvalues, then it admits a unique decomposition 



Pp 



= ^m.n^(x.). 



(2) 



Here, IK^Xi) denotes the normalised projection operators 
onto the eigenstates Xi of pS, and the corresponding prob- 
ability weights Wi satisfy Wi > and X)i ^^ = 1- Some 
care has to be taken with this interpretation of p^, be- 
cause in the present context the underpinnings of classical 
probability are missing, and the associated terminology 
can only be used, therefore, by analogy. Nevertheless, 
von Neumann M , in pursuit of this analogy, was led by 
a series of ingenious arguments involving the thermody- 
namics of a hypothetical gas of independent systems rep- 
resented by a weighted family of orthogonal pure states, 
to argue that the quantity 



Dv 



-P^lnpf 



(3) 



represents the entropy of the state p'i . In the example of 
the state (|2|), for instance, we have S'vn = ~ X)i '^i ln""^!: 
which is the classical information entropy associated with 
the probability distribution wi. 

It is clear, nevertheless, that the von Neumann entropy 
is inadequate for some situations. Suppose, for example, 
we make a measurement of an observable with distinct 



eigenstates Xi. Then the results of the measurement can 
be represented statistically by the state (H), where the 
weighting Wi are given by the familiar transition ampli- 
tudes taken with respect to the initial state. In this case, 
the entropy of the distribution is indeed given by S'vn, 
since we know that the measurement results in one of 
the eigenstates Xi being selected, and that the informa- 
tion gained with the knowledge of the outcome precisely 
counterbalances the entropy of the state p^. However, 
this is a special state of affairs, peculiar to the measure- 
ment problem, and there is no a priori justification for 
assuming in general, given p^, that the system is in one 
or another of the eigenstates of p^. In fact, for a given 
p^, the implied minimal information distribution on the 
space of pure states is of a more general character, as we 
shall demonstrate in what follows. 

In this article we introduce a more realistic formula for 
the entropy of a quantum state. Our expression for the 
quantum entropy is in line with that of Shannon; as a 
consequence, many of the standard results for classical 
information entropy apply. The quantum entropy intro- 
duced here differs, in general, from the von Neumann 
entropy. However, like the von Neumann entropy, the 
new entropy can be expressed in terms of the eigenval- 
ues of the density matrix, as we shall illustrate explicitly 
in the case of a system characterised by a two dimen- 
sional Hilbert space. Our methodology has the advan- 
tage that it more satisfactorily takes into account the 
significance of information in modern quantum theory 
B. Indeed, whereas von Neumann specifically accom- 
modates into his thermodynamic analysis as extra infor- 
mation the assumption that the ensemble is composed 
of a weighted system of pure states, each one of which 
belongs to a given complete family of orthogonal pure 
states, we make no such assumption here. Instead, in 
our approach to the quantum entropy problem, we shall 
be guided by information theoretic principles. 

The other ingredient at our disposal, missing in von 
Neumann's theory, is the recognition that the space of 
pure states in quantum theory has the structure of a 



phase space; that is to say, it admits a natural symplec- 
tic structure. The quantum phase space F is a complex 
projective space endowed with a Hermitian correlation 
between points and hyperplanes. A point x € F repre- 
sents a pure state, i.e., an equivalence class of wave func- 
tions belonging to the same ray in Hilbert space. When 
viewed as a real manifold, F is known to have a natural 
Riemannian geometry, given by the Fubini-Study metric, 
which has a compatible symplectic structure associated 
with it Q . A typical quantum observable Fo is given by 
a function F{x) on F of the form 



Fix) 



Mx)F^i^'^(x) 



(4) 



where il)'^(x) denotes any wave function in the equiva- 
lence class associated with the pure state x. With a slight 
departure from the traditional terminology we can refer 
to the function F{x) itself as the observable. Then \iF{x) 
and G{x) are observables, their Poisson bracket with re- 
spect to the symplectic structure is also an observable, 
given by i times the expectation of the commutator of 
the corresponding operators, taken in the pure state x. 
The resulting algebra of quantum observables gives F the 
structure of a Poisson manifold, and as a consequence the 
Schrodinger trajectories of pure states are given by the 
integral curves of the symplectic vector field for which 
the generator H{x) is the quantum Hamiltonian. 

We shall take the view here that a general quantum 
state is represented by a density function p{x) on F, sat- 
isfying p{x) > and 



p{x)dV 



1, 



(5) 



where dV is the volume element associated with the 
Fubini-Study metric. Thus we can think of p{x) as an en- 
semble on the phase space F. For example, let us consider 
the measurement of an observable F{x) with distinct 
eigenstate xi, when initially the system is in a given pure 
state xq. Then for the density function corresponding 
to an ensemble consisting of a large number of indepen- 
dent identical copies of the system we can write p{x) = 
S{x, xq) for the initial state, and p{x) = J2i ^i^ix, Xi) af- 
ter the measurement has been performed. Here 5{x^Xi) 
denotes a delta function on F, concentrated at the point 
Xi, and Wi is the transition amplitude between the states 
xo and Xi. The expectation of an observable F{x) in the 
general state p{x) is then given by 



(^> 



F{x)p{x)dV. 



(6) 



We can regard (P) as equating (F) with the uncondi- 
tional expectation of the conditional expectation F{x) 
in the pure state x. The dynamical evolution of p{x) 
is governed by the Liouvillc equation, where the Pois- 
son bracket between p{x) and H[x) is determined by the 



symplectic structure on F. If p(x) is initially given by a 
delta function concentrated on a single pure state, then 
subsequently it remains of that form, and the point of 
concentration follows a Schrodinger trajectory. 

Now, suppose we introduce the projection operator 



m{x) 



Tl)f3{x)'lp"'{x) 

^P^{x)iP^ix) 



(7) 



corresponding to the pure state represented by a generic 
point X (z F. Then, the general quantum state can be 
expanded in terms of its moments M. In particular, the 
lowest moment of 11 0(2:) in the state p{x) gives rise to 
the density matrix of ordinary quantum mechanics: 



Pp 



p{x)n'§{x)dv. 



(8) 



It follows from the formulae above that the expectation 
(0) agrees with the standard trace formula (El), provided 
F{x) is a linear observable of the form (Q), that is, F{x) — 
Fp'Il^{x). An advantage of the general expression (0) is 
that it can also be applied in the case of a nonlinear 
observable of the Kibble- Weinberg type [||. It should 
be emphasised nevertheless that when we consider the 
statistical properties of ordinary linear observables, this 
formulation of quantum mechanics on F is equivalent to 
the conventional Hilbert space approach. 

Under suitable technical conditions the information in 
the state p{x) can be represented by the totality of its 
moments, and a unique expansion of the form 



p{x) = 1 + p'^^n^^ix) + A.^;^;n^(x)nf,(x) 



(9) 



exists, where the /x-coefBcients are trace-free and totally 
symmetric. A calculation then shows that the n-th co- 
efficient is given, up to a combinatorial factor, by the 
trace- free part of the n-th moment of H^ (x) . It follows 
that the density matrix of ordinary quantum mechanics 
in general does not contain all of the information about 
the state of the system. This remains the case a fortiori 
if we relax the technical conditions and allow p{x) to be- 
long to a broader class of measures. However, if we wish 
to consider the statistical properties of linear observables, 
then, owing to formula (m) , it suffices to consider the den- 
sity matrix exclusively. Because our intention here is to 
investigate the entropy in ordinary quantum mechanics, 
we shall therefore examine the consequences of assum- 
ing that the information encoded in the density matrix 
is the only information available to us. In this context 
it is worth recalling the work of Mielnik Q , who regards 
the state in ordinary quantum theory as an equivalence 
class of density functions each of which gives rise to the 
same density matrix. We note, however, that there is 
a subtle deficiency in his approach, because it treats all 
distributions that give rise to the same density matrix 
on an equal footing. Clearly, some distributions contain 
more information than others, and according to the gen- 
eral principles of information theory we must look for the 



distribution that is least informative, subject to the con- 
dition that it is consistent with the prescribed density 
matrix. 

It should be evident from the foregoing discussion that 
the appropriate expression for the Shannon entropy of a 
quantum state p{x) is 



Sp = — I p{x) In p{x)dV. 



(10) 



Because p{x) is a probability density function defined on 
the smooth manifold F, it follows that Sp possesses the 
standard properties of the Shannon entropy. The ques- 
tion we have to address here is thus: given a density ma- 
trix p'a, how do we express the corresponding quantum 
entropy Sp in terms of it? Clearly, for a generic density 
matrix, there exist many different density functions p{x) 
that give rise to the same p^. Therefore, it is not ob- 
vious which p{x) we should select. This problem can be 
resolved by recalling our assumption that the density ma- 
trix is the only information available to us. This implies 
that the relevant density function p{x) is the one with 
minimum information, or maximum entropy Sp, subject 
to the constraint (pi). If we let A2 denote the Lagrange 
multiplier required for this extremisation problem, then 
the solution is a distribution of the canonical form 



p{x) = exp(-AX(^)-ln^(A)), 



(11) 



where the normalisation is given by the generating func- 
tion 



Z{X) 



j^e^j>{~\ini{x))dV. (12) 



The Lagrange multiplier Xi is determined, up to an ar- 
bitrary trace term, by the constraint 



d\nZ{\) 



Pp- 



(13) 



The result (llll) is perhaps surprising because in the liter- 
ature of quantum theory the canonical distribution func- 
tion arises typically in the thermal context. 

It follows from the expression for the minimum infor- 
mation distribution function that the quantum Shannon 
entropy associated with the density matrix p'^ is given by 
a Legendre transformation 



Sp = A^p(^+lnZ(A), 



(14) 



where A^ is determined by the relation (g^. Alterna- 
tively, we can combine dl3) and (04) and define Sp ac- 
cording to the scheme 



Sp = sup(A^p;^ + lnZ(A)). 



(15) 



In fact, one can show that InZ(A) is convex on the vec- 
tor space obtained by eliminating the trace of X^. The 



argument, as we indicate below, is reminiscent of the 
reasoning used to demonstrate the positivity of the heat 
capacity in statistical mechanics. It follows that InZ(A) 
is the convex dual of the entropy, and that Sp is concave 
over the space of density functions. More specifically, we 
find that 



d'^lnZ 

dxldxi, 



p{x){li1-pl){IiJ~pJ)dV, (16) 



which shows that the Hessian of hiZ{\) is given by 
the covariance of the projection operator Ho (a;), which 
is positive definite for trace-free displacements in the 
value of A^. Indeed, the Hessian is independent of A^, 
since under the transformation A^ ^ A^ -t- p5'^ we have 



Z{X) -^ e-i'ZiX). It thus follows that (|T|) defines a Rie- 
mannian metric, known as the Fisher-Rao metric, on the 
parameter space of the distribution (11). Therefore, by 
convex duality , we conclude that Sp is concave in the 
sense that if p^(i) are density matrices for i = 1, 2, • ■ • , n 
and if {wi} is a set of probability weights, then 



'^w,p'jl{i 



> 



J2w,Sp[p'^p{{)] 



(17) 



where Splpg] denotes the entropy ( p^ associated with a 
given density matrix p'i. 

This is our main result for the quantum entropy. To 
see that Sp differs from S'vn as a function of pp we pro- 
ceed as follows. Suppose, on the contrary, that there 
exists a constant A, independent of pg, such that Sp = 
S'vn + In^. Then solving for p^ by use of (|) and (|l|) 
we obtain p^ — Aexp(— A^)/Z(A), which implies that 
/^H^exp(-A^n^(a;))rfF = Aexp(-A;^) holds for all A;^. 
Expanding each side to first order in A « , we reach a con- 
tradiction. 

We have demonstrated that if the information available 
at our disposal is given solely by the density matrix p^, 
then the corresponding entropy is given by (|lj). Con- 
versely, any other form of entropy, such as that of von 
Neumann, implies the knowledge of information other 
than Pg, even if the entropy itself can be expressed in 
terms of p'^. Hence, in a strict sense, any other choice 
of entropy goes beyond the category of linear quantum 
mechanics, as is consistent with the fact that the von 
Neumann entropy gives the correct result in the case of 
a measurement outcome. This implication is implicit in 
the extremisation procedure used to obtain the probabil- 
ity distribution (pT|). 

Given expression ( [l4|) for the quantum entropy, it is 
not readily obvious how Sp depends on the eigenvalues 
of Pg . In order to see this, all we require is the generating 
function Z{X) in (|l2|). As an illustration, let us consider 
the case of a two state system. We choose the basis 
where the density matrix is diagonal, with elements pi 
and p2 = 1 — pi. Because Xg commutes with pg, in 
this basis Xg is also diagonal, with eigenvalues Ai and 



A2. The F-space integration for the generating function 
Z(X) can be hfted to C^ with a spherical constraint on 
TJj°'{x). The integration involves a Gaussian (cf. [g|), and 
we obtain Z(A) = {2TTf{e"^^ 
which it follows that 



e~'^i)(Ai - A2)"~, from 



Pi 



Ai — A2 



1 



dAi — A2 



(18) 



Then because the dependence on AS is only up to the 
eigenvalue difference, we can set A2 = A and Ai = —A. 
With these expressions at hand, we can compare the 
quantum entropy with the von Neumann entropy. The 
qualitative behaviours of Sp{X) and 5'vn(A) in this ex- 
ample turn out to be similar, though not identical, as 
illustrated in Fig. 1, where we compare plots for the A- 
derivatives of the two entropies. The two curves agree in 
the pure-state limits A -^ ±c». 




FIG. 1. The plots for the entropy derivative dSp/dX 
(lower-right curve) and the corresponding derivative dS^jsi/dX 
(upper-right curve) for the von Neumann entropy, for a 
two-state quantum system, showing that the two entropies 
have qualitatively similar behaviour. 

Although we have only shown explicit results for a two- 
state system, it is worth remarking that the F-space inte- 
gration dl2) for the general generating function is invari- 
ably a Gaussian, and that the derivation of the entropy 
thus remains tractable for all finite dimensionalities. 

In summary, we have introduced the idea of a proba- 
bility density function p{x) on the space of rays through 
the origin of the Hilbcrt space that only reflects the infor- 
mation provided by the density matrix. Based upon this 
we were able to obtain the Shannon entropy for a quan- 
tum state, which, from an information theoretic point of 
view, is superior to von Neumann's proposal for the en- 
tropy. The utility of the distribution (O) does not exclu- 
sively reside, however, in studying the entropy of quan- 
tum states. In fact, it can be applied to numerous other 
probabilistic and information theoretic aspects of quan- 
tum mechanics, as well as quantum estimation theory. 
For example, the Lagrange multiplier A^ in the foregoing 



analysis can be viewed as parameterising the quantum 
state p'i of the system. Then, in the problem of esti- 
mating an unknown quantum state p , it is of interest to 
consider the Fisher information matrix which determines 
the variance lower bound (cf. pi|). In the present con- 
text, this is given by the Hessian ( [l6| ) of the generating 
function InZ(A), which can be computed explicitly for a 
given po. The use of the minimal information state p(x) 
can also be applied to the theory of quantum communi- 
cation. We hope that the approach introduced here will 
offer further insights into the understanding of quantunr 
theory. 
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